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Abstract. In this article we develop convergence theory for a class of goal-oriented 

adaptive finite element algorithms for second order semilinear elliptic equations. We 

first introduce several approximate dual problems, and briefly discuss the target prob- 

OQ lem class. We then review some standard facts concerning conforming finite element 

discretization and error-estimate-driven adaptive finite element methods (AFEM). We 
include a brief summary of a priori estimates for semilinear problems, and then describe 
goal-oriented variations of the standard approach to AFEM (GOAFEM). Following the 
recent approach of Mommer-Stevenson and Hoist-Pollock for linear problems, we first 
establish a contraction result for the primal problem. We then develop some additional 
<^5 estimates that make it possible to establish contraction of the combined primal-dual 

\f\ quasi-error, and subsequently show convergence in the sense of the quantity of inter- 

est. Numerical experiments support the theoretical results. 
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In this article we develop convergence theory for a class of goal-oriented adaptive 
finite element methods for second order semilinear equations. In particular, we establish 
strong contraction results for a method of this type for the problem: 

> / N(u) := -V • (AVu) + b(u) = f, in fi, 



■)«) (1-1) 

^ u = 0, on oil, 

with / G L 2 {yt) and Vt C M. d (d = 2 or 3) a polyhedral domain. We consider the problem 
• with A: Vl — > M. dxd Lipschitz and almost-everywhere (a.e.) symmetric positive definite 

(SPD). The standard weak formulation of the primal problem reads: Find u e Hq(Q) 
<N such that 



(Ar(u),v):=a(u,v) + (b(u),v) = f(v), Vv 6 H^Q), (1.2) 



S^ where 

03 a(u,v) = / AVu-Vvdx. (1.3) 

Jn 
In many practical applications, one is more interested in certain physical quantities 
of the solution, referred to as "quantities of interest", such as (weighted) averages, flow 
rates or velocities. These quantities of interest are usually charactered by the value g(u), 
where u is the solution and g is a linear functional associated with a particular "goal". 
Given a numerical approximation Uh to the solution u, goal-oriented error estimates use 
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duality techniques rather than the energy norm alone to estimate the error in the quan- 
tity of interest . The solution of the dual problem can be interpreted as the generalized 
Green's function, or the influence function with respect to the linear functional, which 
often quantifies the stability properties of the computed solution. There has been a lot of 
recent work on developing reliable and accurate a posteriori error estimators for goal- 
oriented adaptivity, see [12, 4, 5, 14, 35, 13, 18, 19, 30] and the references cited therein. 

Our interest in this paper is developing a goal-oriented adaptive algorithm for semi- 
linear problems (1.2) along with a corresponding strong contraction result, following the 
recent approach in [32, 23] for linear problems. One of the main challenges in the non- 
linear problem that we don't see in the linear case is the dependence of the dual problem 
on the primal solution u. As it is not practical to work with a dual problem we cannot 
accurately form, we develop a method for semilinear problems in which adaptive mesh 
refinement is driven both by residual-based approximation to the error in u, and in a se- 
quence of approximate dual problems which only depend on the numerical solution from 
the previous step. While globally reducing the error in the primal problem necessarily 
yields a good approximation to the goal error g(u — Uh), methods of the type we de- 
scribe here bias the error reduction in the direction of the goal-function g in the interest 
of achieving an accurate approximation in fewer adaptive iterations. 

Contraction of the adaptive finite element algorithm for the (primal) semilinear prob- 
lem (1.2) has been established in [25] and [22]. Here we recall the contraction argument 
for the primal problem and use a generalization of this technique to establish the con- 
traction of a linear combination of the primal and limiting dual quasi-errors by means of 
a computable sequence of approximate dual problems. We relate this result to a bound 
on the error in the quantity of interest. Following [25], the contraction argument follows 
from first establishing three preliminary results for two successive AFEM approxima- 
tions Mi and u 2 , and respectively z\ and z 2 of the primal and limiting dual problems. 

1) Quasi-orthogonality: There exists A G > 1 such that 

III III 2 ^ a III III 2 III III 2 

||p — W2||| ^ Aq I U — Wi| — ||p2 — Wl||| • 

2) Error estimator as upper bound on error: There exists C\ > such that 

l\u-u k \l 2 <CiT]l(u k ,T k ), A; = 1,2. 

3) Estimator reduction: For M the marked set that takes refinement 71—7-72, for 
positive constants A < 1 and Ai and any 5 > 

Vl(v2, Ti) < (1 + 5){vKvi, Tx) - Xvl(v x ,M)} + (1 + r^Ai^lh - ui||. 

For the primal problem, the mesh at each iteration may be marked for refinement with 
respect to the error indicators following the Dorfler marking strategy (cf. [11]). In the 
case of the dual, the limiting estimator as used in the contraction argument is related to 
a computable quantity. This quantity is the dual estimator, based on the residual of the 
approximate dual sequence. The mesh is marked for refinement with respect to this set of 
error indicators, which correspond to the approximate dual problem at each iteration. The 
transformation between limiting and approximate dual estimators couples the contraction 
of error in the limiting dual to the primal problem. The final result is the contraction of 
what we refer to here as the combined quasi-error 

Q 2 (uj, ij) := p - Zj\\ 2 + jCi(Zj) + n l\ u ~ u j\\ 2 + 7r "Yp r l2( u j)i 

which is the sum of the quasi-error as in [8] for the limiting dual problem and a multiple 
of the quasi-error for the primal problem. The contraction of this property as shown in 
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Theorem 5.9 establishes the contraction of the error in the goal function as shown in 
Corollary 5.10. 

Our analysis is based on the recent contraction framework for semilinear and more 
general nonlinear problems developed by Hoist, Tsogtgerel, and Zhu [25], and by Bank, 
Hoist, Szypowski and Zhu [2], and those for linear problems developed by Cascon, 
Kreuzer, Nochetto and Siebert [8], and by Nochetto, Siebert, and Veeser [34]. In ad- 
dressing the goal-oriented problem we base our framework on that of Mommer and 
Stevenson [32] for symmetric linear problems and Hoist and Pollock [23] for nonsym- 
metric problems. We note also two other recent convergence results in the literature 
for goal-oriented adaptive methods applied to self-adjoint linear problems, namely [10] 
and [33], both providing convergence rates in agreement with those in [32]. The analysis 
of the goal-oriented method for nonlinear problems is signficantly more complex than 
the previous analysis for linear problems in [32, 23]. Here, we are faced with analyzing 
linearized and approximate dual sequences as opposed to a single dual problem in order 
to establish contraction with respect to the quantity of interest. However, this approach 
allows us to establish a contraction result for the goal-oriented method, which appears to 
be the first result of this type for nonlinear problems. The linearized dual in the context 
of goal-oriented adaptive methods is described below, following e.g. Estep et. al in [14] 
and [15]. 

Outline of the paper. The remainder of the paper is structured as follows. In §2, we 
introduce the approximate, linearized and limiting dual problems. We briefly discuss 
the problem class and review some standard facts concerning conforming finite element 
discretization and error-estimate-driven adaptive finite element methods (AFEM). In §2.2 
we include a brief summary of a priori estimates for the semilinear problem. In §3, we 
describe a goal-oriented variation of the standard approach to AFEM (GOAFEM). In §4 
we discuss contraction theorems for the primal problem. In §5 we introduce additional 
estimates necessary for the contraction of the combined quasi-error and convergence in 
the sense of the quantity of interest. Lastly, in §6 we present some numerical experiments 
that support our theoretical results. 

2. Preliminaries 

In this section, we state both the (nonlinear) primal problem and its finite element 
discretization. We then introduce the linearized dual problem, and consider some variants 
of this problem which are of use in the subsequent computation and analysis. 

Consider the semilinear problem (1.2), where as in (1.3) we define the bilinear form 

a(u,v) = (AVu,Vv) 

with (•, •) denoting the L 2 inner-product over fi C W 1 . We make the following assump- 
tions on the data: 

Assumption 2.1 (Problem data). The problem data D = (A, b, f) satisfies 

1) A : Q — > M. dxd is Lipschitz continuous and a.e. symmetric positive-definite with 

ess inf xen X min (A(x)) = p > 0, 
ess sup xm \ max {A(x)) = \x x < oo. 

2) b : Q x R — >■ R is smooth on the second argument. Here and in the remainder 
of the paper, we write b{u) instead ofb(x, u) for simplicity. Moreover, we assume 
that b is monotone (increasing): 

&'(£) >0, for all (el. 
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3) fe L 2 (Q). 

The native norm is the Sobolev H 1 norm given by \\v \\ 2 H1 = (Vt>, Vv ) + (v, v). Con- 
tinuity of a( ■ , • ) follows from the Holder inequality, and bounding the L 2 norm of the 
function and its gradient by the H 1 norm 

a(u,v) < fii\\u\\ H i\\v\\ H i = M £ \\u\\ H i\\v\\ H i. (2.1) 

Define the energy semi-norm by the principal part of the differential operator \\v\\ 2 := 
a(v , v ). The coercivity of a( • , • ) follows from the Poincare inequality with constant Cq 

a(v,v) > iu, \v\ 2 H i > C n fi \\v\\ 2 H i = m 2 s \\v\\ 2 H i, (2.2) 

which establishes the energy semi-norm as a norm. Putting this together with (2.1) es- 
tablishes the equivalence between the native and energy norms. 

2.1. Linearized dual problems. Given a linear funcitonal g e L 2 {Vt), the objective 
in goal-oriented error estimation is to relate the residual to the error in the quantity of 
interest. This involves solving a dual problem whose solution z satisfies the relation 
g(u — u h ) = (R(u h ), z). In the linear case, the appropriate dual problem is the formal 
adjoint of the primal (cf. [31, 23]). For b nonlinear, the primal problem (1.2) does not 
have an exact formal adjoint. In this case we obtain the dual by linearization. Formally, 
given a numerical approximation Uj to the exact solution u, the residual is given by 

R( Uj ) ■= f - X(Uj) = Af(u) - M(Uj). 

If z^ G Hq (tt) solves the following linearized dual problem 

a(z j , v) + (B jZ 1, v) = g(v), Vv e H^Q), (2.3) 

where g(v) := J n gvdx and the operator Bj is given by 

B 3 := / b\iu + (1 - i)u 3 ) di= [ b'iuj + (u- u,)0 de, (2-4) 

Jo Jo 

then the goal-oriented error g(ej) can be represented exactly by the inner product of z'-> 

and R(uj): 

g(ej) = (R(uj),z 3 ). 

In fact, by definition of the residual R(uj), we have 

(z j , R(uj)) = a(z j , Cj) + (z j , b(u) - b(uj)) = a(z j , e,-) + (BjZ j , v) = g(ej). 

Here we used the integral mean value identity: 

b(u) — b(uj) = / b'(uj + (u — Uj)£) d£(u — Uj) = Bj(u — Uj). 
Jo 

The derivation and numerical use of the linearized dual problem is further discussed 

in [15, 14,21]. 

Unfortunately, the dual problem (2.3) is not practical because the operator Bj depends 

on the exact solution u. In order to define a computable dual operator, we introduce 

the approximate operator b'(uj), which lead to the following approximate dual problem: 

Find F G #o(fi) such that 

a(z j J v) + (b'(u j )z j ,v) =g(v), Vv e £#(fi). (2.5) 

The equation (2.5) is instrumental for defining a computable a posteriori error indicator 
for the dual problem. 

A further difficulty arises in the analysis of the goal-oriented adaptive algorithm driven 
by the a posteriori error estimators for the approximate dual problem (2.5). Due to the 
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dependence on Uj, (2.5) changes at each step of the adaptive algorithm. This is one of 
the essential differences of the nonlinear problem as compared to the linear cases in the 
previous literature (cf. [32, 23]). To handle this obstacle, we introduce the limiting dual 
problem: Find z E Hq (Cl) such that 

a(z, v) + (b'(u)z, v) = g(v), Vv G H^to). (2.6) 

While the operator b'(u) is a function of the exact solution u and is not a computable 
quantity, it is the operator used in the limit of both the linearized dual (2.3) and approx- 
imate dual problems (2.5) as Uj — > u. Therefore, both the linearized and approximate 
sequences approach the same limiting problem (2.6). Our contraction result in Theo- 
rem 5.9 is written with respect to the limiting dual problem as defined by the operator 
b'(u). 

2.2. Finite Element Approximation. For a given conforming, shape-regular triangula- 
tion T of Q consisting of close simplices T G T, we define the finite element space 

V r := H^Q) n J] P n (T) and V fc := V Tfc , (2.7) 

where P„(T) is the space of polynomials degree degree n over T. For any subset S C T, 

¥ T (S):=H l (Q)nl[F n (T). (2.8) 

Tes 

Given a triangulation T, we denote hj := max^er hr where hr '■= IT] 1 ^. In particu- 
lar, we denote ho : = hj- for an initial (conforming, shape-regular) triangulation % of f2. 
Then the adaptive algorithm discussed below generates a nested sequence of conforming 
refinements {71}, with Tk > Tj for k > j meaning that Tk is a conforming triangulation 
of Q based on certain refinements of Tj. With this notation, we also simply denote by 
Vfc := V-7- fc the finite element space defined on Tk- 

The finite element approximation of the primal problem (1 .2) reads: Find u k E Yk such that 

a{u k , v k ) + (b(u k ), v k ) = f(v k ), v k E V fc , (2.9) 

and the finite element approximation of (2.5) linearized about Uj is given by: Find z 3 k E 
Vfc such that 

a(zl, v k ) + (b'(uj)zl, v k ) = g(v k ) for all v k E V fc . (2.10) 

Finally, for the purpose of analysis, we require the discrete limiting dual problem (cf. 
(2.6)) given by: Find z k E Vfc such that 

a(z k ,v k ) + (b'(u)z k ,Vk) = g(v k ) forall^fc G Vfc. (2.11) 

Existence and uniqueness of solutions to the primal problems (1.2) and (2.9) follow 
from standard variational or fixed-point arguments as in [38] and [29]. For the dual 
problems (2.5)-(2.6) and (2.10)-(2.11) the existence and uniqueness of solutions follow 
from the standard Lax-Milgram Theorem as in [17], since we assumed that &'(£) > 0. 

We make the following assumption on the a priori L^ bounds of the solutions to the 
primal problems (1.2) and (2.9): 

Assumption 2.2 (A priori bounds). Let u and u k be the solution to (1.2) and (2. 9), respectively. 
We assume that there are u^,u + E L^ which satisfy 

U-(x) < u(x),u k (x) < u + (x) for almost every x E Q. (2.12) 
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Remark 2.3. The L^ bound on u follows from the standard maximum principle, as 
discussed in [3, Theorem 2.4] and [24, Theorem 2.3]. There has been a lot of literature 
on the L^ bounds on the discrete solution, usually with additional angle condition of the 
triangulation (cf [28, 26, 27, 24] and the references cited therein). On the other hand, 
ifb satisfies the (sub)critical growth condition, as stated in [3, Assumption (A4)], then 
the L^ bounds on the discrete solution Uk are satisfied without angle conditions on the 
mesh, see [3] for more details. 

Assumption 2.1 together with Assumption 2.2 yield the following properties on the 
continuous and discrete solutions as summarized below. 

Proposition 2.4. Let the problem data satisfy Assumption 2.1 and Assumption 2.2. The 
following properties hold: 

1) b is Lipschitz on [«_, u + ] fl H^ (fl) for a.e. x G fl with constant B. 

2) b' is Lipschitz on [«_, u + ] fl Hq (fl) /or a.e. x G fl with constant 6. 

3) Let z the solution to (2.6), zj the solution to (2.10) and Zj the solution to (2.11). 
Then there are z_,z + G Loo which satisfy 

Z-(x) < z(x),Zj(x),Zj(x) < z + (x) for almost every x G fl, j G N. (2.13) 

3. Goal Oriented AFEM 

In this section, we describe the goal oriented adaptive finite element method (GOAFEM), 
which is based on the standard AFEM algorithm: 

SOLVE ->- ESTIMATE ->• MARK ->• REFINE . 

Below, we explain each procedure. 

Procedure SOLVE. The procedure SOLVE involves solving (2.9) for Uj, computing 
b'(uj) to form problem (2.10) and solving (2.10) for z 3 -. In the analysis that follows, we 
assume for simplicity that the exact Galerkin solution is found on each mesh refinement. 
In practice the nonlinear problem (2.9) may be solved by a standard inexact Newton + 
multilevel algorithm as in [2]. The approximate dual problem (2.10) may be solved by 
any standard linear-time iterative method. 

Procedure ESTIMATE. We use a fairly standard residual-based element-wise error es- 
timator for both primal and approximate dual problems. Recall that the residual of the 
primal problem is given by R(v) = f — J\f(v) with J\f(v) = —V ■ (AVv) + b(v). 
For the limiting and approximate dual problems, we define the local strong form by 
£*(v) -.= -V-(AS7v)+V(u)(v), and £*{v) := -V ■ (AVv) + b\u j ){v). The limiting 
and approximate dual residuals given respectively by 

R*(v) : =g- C*(v), and R*(v) := g - C*(v). (3.1) 

The jump residual for both the primal and linearized dual problems is: 

J T (v) := l[AVv] ■ njar, 

where [ ■ J is given by [</>]#r : = hm^o <f)(x + tn) — <\>{x — tn)and n is taken to be the 
appropriate outward normal defined on dT. On boundary edges o h we have |[v4Vt>] ■ 
njff b = so that [[AVv] • n]#r — [[^4Vi>] ■ njQ Tn Q. The error indicator for the primal 
problem (2.9) is given by 

V 2 r (v,T) := h 2 T \\R(v)\\ 2 L2{T) + h T \\J T (v)\\l 2{dT) , v G V r . (3-2) 
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Similarly, the dual error-indicator is given by the approximate residual 

C 2 T /w,T) := h 2 T \\R*(w)\\l 2(T) + h T \\J T (w)\\l 2(dT) , w E V T . (3.3) 

This dual indicator is defined in terms of the approximate dual operator b'(uj) as this is a 
computable quantity given an an approximation Uj. In addition, for purpose of analysis 
we define the limiting dual error-indicator by 

&(w,T) := h 2 T \\R*(w)\\l 2(T) + h T \\J T (w)\\ 2 L2{dT) , w E V r . (3.4) 

We remark that the limiting dual indicator as given by (3.4) is not computable. For any 
given subset S C T, the error estimators on S are given by the 1% sum of error indicators 
over elements in the space. 

v 2 t (v,S):=J2vt(v,T), veY T - 
Tes 
The dual energy estimator is: 

res 
and the limiting estimator 

#(*>,$):= £)#(«>, T), weV r . 
res 
To simplify the notation, below we will omit "5" in the above definitions if S = T and 
we will use r} k to denote 777^., and similarly use (&,• to denote (%,■■ 

As in [8] it is not difficult to verify that the indicators for the primal and approximate 
(respectively limiting) dual problems satisfy the monotonicity property for v E V(7i) 
and 72 > 7i 

V2(v,%) < r]i{v,Ti), (2j(v,T 2 ) < Ci>,Ti) and C 2 (^,T 2 ) < Ci(v,7i). (3.5) 

For an element T E T2 fl 7i 

%(t;,T) = 771KT), C 2J (^,T) = Cij(u,T) and ( 2 (v,T) = ( x (v,T). (3.6) 

Procedure MARK. The Dorfler marking strategy for the goal-oriented problem is based 
on the following steps as in [32]: 

1) Given 9 E (0, 1), mark sets for each of the primal and dual problems: 

• Mark a set M. p C % such that 

rit{u k ,M p )>e 2 rJl{u k ,%). (3.7) 

• Mark a set Aid C Tk such that 

(t k (4,M d )>9 2 (l k (z k k ,%). (3.8) 

2) Let .M = Ai p U A^rf the union of sets found for the primal and dual problems 
respectively. 

As in [23] the set M. differs from that in [32], where the set of lesser cardinality between 
Aip and Aid is used. As seen in (3.8) the mesh is marked with respect to the dual indi- 
cators of the approximate-sequence solutions z\ as these are computable quantities. Sets 
Ai p and Aid with optimal cardinality (up to a factor of 2) can be chosen in linear time by 
binning the elements rather than performing a full sort [32]. 

Procedure REFINE. The refinement (including the completion) is performed according 
to newest vertex bisection which was first proposed in [36]. It has been proved that 



8 M. HOLST, S. POLLOCK, AND Y. ZHU 

the bisection procedure will preserve the shape-regularity of the initial triangulation %. 
The complexity and other properties of this procedure are now well- understood (see for 
example [6] and the references cited therein), and will simply be exploited here. 

4. Contraction for the primal problem 

Here we discuss the contraction of the primal problem (1 .2), recalling results from [25], 
[24] and [2]. The contraction argument relies on three main convergence results, namely 
quasi-orthogonality, error-estimator as upper bound on error and estimator reduction. We 
include the analogous results here for the limiting dual problem when they are identical 
or nearly identical. 

4.1. Quasi-orthogonality. Orthogonality in the energy-norm ||w — W2I 2 = \\u — Ui\\ 2 — 
\\u2 — Mi ||| 2 does not generally hold in the semilinear problem. We rely on the weaker 
quasi-orthogonality result to establish contraction of AFEM (GOAFEM). The proof of 
the quasi-orthogonality relies on the following L 2 -lifting property. 

Lemma 4.1 (L 2 -lifting). Let the problem data satisfy Assumption 2.1 and Assumption 2.2. 
Let u be the exact solution to (1.2), and U\ G Vi the Galerkin solution to (2.9). Let 
w G H 1+S (Q) n Hq(Q) for some < s < 1 be the solution to the dual problem: Find 
w G Hq(Q) such that 

a(w,v) + (B 1 w,v) = {u-U!,v), v e H^(Q), (4.1) 

where the operator B\ is defined by B\ := J Q b'{^u + (1 — £) M i) d£. As in [9, 16, 1] we 
assume the regularity 

Mfrn-(n) < k r\\ u ~ wi||La(n) ( 4 - 2 ) 

based on the continuity of the diffusion coefficients a^ and B\ G L 00 (fi) . Then 

\u — ui\\l2 < C*h S Q I u — u\\. (4.3) 

Proof. The proof follows the standard duality arguments in [1], [23] and [7], adapted for 
the semilinear problem. LetX^ : Hq(Q) — > Vi be a quasi- interpolator, satisfying 

\\w -l h w\\ H i <C x h s Ti \w\ H i+s (4.4) 

\\w - T h w\\ L2 < C x h}j{ s \w\ H i + s. (4.5) 

as discussed in [1], [37] and [23]. 

Consider the linearized dual problem (4.1) with v = u — u\ G Hq(CI) expressed in 
primal form 

a(u -U!,w) + (Bi(u -«i),w) = ||w - Ui\\ 2 L2 . (4.6) 

By Galerkin orthogonality, for X h w G Vi 

a(u - u u X h w) + (Bi(w - u x ),X h w) = 0. (4.7) 

Subtracting (4.7) from (4.6) 

a(u-ui,w -X h w) + (b(u) -b{ux),w-X h w) = ||«-wi||i a - (4.8) 

Then by (2.1) continuity of a( ■ , ■ ), the Holder inequality and Lipschitz continuity of b 
(Proposition 2.4): 

ll M — m iIIl 2 — ^sll^ — w-i || jyi \\w — X h w\\ H i + B\\u — ui\\i, 2 \\w — X h w\\l 2 . (4.9) 
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By coercivity (2.2), interpolation estimate (4.4), and regularity (4.2) on the first term on 
the RHS of (4.9) 

M £ \\u — ui\\ H i\\w — X h w\\ H \ < — Or/in I it — Wi||||if|fl- 1 + s 

m £ 

M £ 
< — K R C x h s \\u-ui \l\\u-Ui\\ L2 . (4.10) 

m £ 

For the second term of (4.9), apply (4.5) followed by (4.2) and coercivity to the interpo- 
lation error yielding 

B\\u — Ui\\l 2 \\w ~% w\\l 2 < BCxh\ +S \\u — ■Ui||l 2 M#i+ s 

< K R BC x hl +s \\u - u^l^u - Ui\\ L , 2 

< (m £ l K R BCxho)h s Q \\u - ui\\ L2 \u - ui\. (4.11) 
Applying (4.10) and (4.1 1) to (4.9), we obtain 

||u - «i|| La < rrq x K R (m £ C x + BC x ho) h s \lu - «i|||. (4.12) 

This completes the proof. D 

Remark 4.2. This proof yields the related result for Ui G Vj the solution to (2.9), i — 1, 2 

\\u2-ui\\ L , 2 < C*h,Q \\u2 — Ux\\. (4.13) 

The proof of (4.13) follows by replacing u by u 2 in 4.1. In particular, the dual prob- 
lem (4.1) is replaced by: Find w G V 2 such that 

a(w,v) + {B 12 w,v} — (« 2 — u\,v), v G V 2 , (4.14) 

where the operator B\ 2 '■= f b\^u 2 + (1 — £) M i) d£,. 

Remark 4.3. As the dual problem (4.1) changes at each iteration, so may the regular- 
ity constant as given by (4.2) as well as the interpolation constants as given by (4.4) 
and (4.5). As such, the previous lemma shows a C* t k for k = 1,2,.... As the algo- 
rithm is run finitely many times, we consolidate these C* t k into a single constant C*for 
simplicity of presentation. 

Now we are in position to show the quasi-orthogonality. 

Lemma 4.4 (Quasi-orthogonality). Let the problem data satisfy Assumption 2. 1 and As- 
sumption 2.2. Let 71, T 2 be two conforming triangulation of Q with Ti > T\. Let 
u G Hq(Q) the exact solution to (1.2), Ui G V« the solution to (2.9), i = 1,2. There 
exists a constant C* > depending on the problem data D and initial mesh %, and a 
number < s < 1 related to the angles ofdVL, such that if the meshsize ho of the initial 
mesh satisfies A := Bm £ 1 C^hQ < 1, then 

\\u - u 2 \l 2 < Afu - v\\ 2 - \\u 2 - v\\ 2 , VwGV 2 , (4.15) 

and in particular for v = U\ G Vi C V 2 

|||w — u 2 \ 2 < Ag\\u — iiif 2 — \\u 2 — M i|| 2 ) (4-16) 

where 

A := (1 - Bm^ChQ- 1 and A G := (1 - ^C^ 8 )" 1 

and C* is the constant from Lemma 4.1. 
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Proof. For any given v G V 2 , we have 

\l u ~ u 2\\ 2 = \\u — v\\ 2 — \\v — u 2 \\ 2 + 2a{u — u 2 ,v — u 2 ). (4.17) 

By Galerkin orthogonality 

a(u - u 2 , v) + (b(u) - b(u 2 ), v ) = for all v G V 2 , (4.18) 

and taking v = v — u 2 in (4.18), we have 

2a(u - u 2 , v - u 2 ) < 2\{b(u) - b(u 2 ), v — u 2 )\ 

<2B\\u-u 2 \\ L2 \\v~u 2 \\ L2 . (4.19) 

Here we used Holder inequality and the Lipschitz property on b (cf. Proposition 2.4). 

In the case of (4.15) applying L 2 -lifting Lemma 4.1 to the first factor on the RHS 
and (2.2) coercivity to the second followed by Young's inequality, we obtain 

2B\\u — u 2 \\l 2 \\v — u 2 \\ L2 < 2Bm^ 1 C^hQl\u — u 2 l\\\v — -u 2 | 

< Bm^C^Ju - u 2 f + Bm^C^Jv - u 2 f . (4.20) 

Applying (4.20) via (4.19) to (4.17) 

(1 - Brrq x C*hl)\\u - u 2 f < \\u - vf - (1 - Bm^C^fv - u 2 f . 

Assuming A := Bvii^CJiq < 1, we have 

||w - u 2 f < k\\u - vf - \\v - u 2 f (4.21) 

with A = (1 - Bm^-CJ^)- 1 . 

In the case of (4.16) applying L 2 -lifting 4.1 to each norm on the RHS of (4.19) by 
means of Remark 4.2 then applying Young's inequality 

2B\\u - u 2 \\ L , 2 \\ui - u 2 \\l 2 < 2Bh 2 Q S Cl\u - u 2 \\u\ - u 2 \\\ 

< Bhl s Cl\\u - u 2 f + BC 2 h 2 s l\ Ul - u 2 f. (4.22) 

Following the same procedure as above yields 

I'M — M 2|| 2 < Ag|w — wij 2 — \u\ — u 2 \ 2 (4.23) 

with K G — (1 — BC 2 ^ 3 )" 1 with the weaker mesh assumption A G := BC 2 hl s < 1. □ 

We note that the second Galerkin orthogonality estimate (4.23) sharpens our results 
but is not essential to establishing them. 

4.2. Error Estimator as Global Upper-bound. The second key result for the contrac- 
tion of the primal problem is the error estimator as a global upper bound on the energy 
error, up to a global constant. The result for the semilinear problem is established in 
[25] and [2] with a clear generalization to the approximate dual sequence, also see [8] 
and [31] for the linear cases. The proof of this result follows from the general a posteriori 
error estimation framework developed in [39, 40]. 

Lemma 4.5 (Error estimator as global upper-bound). Let the problem data satisfy As- 
sumption 2.1 and Assumption 2.2. Let Tk be a conforming refinement of%. Let u G 
Hq{Q) and u^ G V& be the solutions to (1.2) and (2.9), respectively. Similarly, let 
z G Hq(JI) and Zk G V& be the solutions to (2.6) and (2.1 1), respectively. Then there is a 
global constant C\ depending only on the problem data D and initial mesh % such that 

ju-Ukl < CiVk(uk) (4.24) 

and 

p - 41 < CiCfc(ifc). (4.25) 
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4.3. Estimator Reduction. The local Lipschitz property as in [25], analogous to the 
local perturbation property established in [8], is a key step in establishing estimator re- 
duction leading to the contraction result. For any T G T, we denote 

UT := T U {T 1 G T | T n T' is a true-hyperface of T}. (4.26) 

Here, for a <i-simplex T, a true-hyperface is a d — 1 sub-simplex of T, e.g., a face in 3D 
or an edge in 2D. We also define the data estimator on each element TeTas 

rfrD, T) = h% (||divA||L (T) + h- T 2 \\A\\l x{uJT) + B 2 ) , (4.27) 

and denote t]t(D,S) = max*r Gl s?77-(D,T) for any subset S C T. In particular, we 
denote by ?7 := 7777, (D, 7o) the data estimator on the initial mesh. As the grid is refined, 
the data estimator satisfies the monotonicity property for refinements T2 > 7i (cf. [8]): 

r/ 2 (D,r 2 )<r /l (D,ri). (4.28) 

Lemma 4.6 (Local Lipschitz Property). Let the problem data satisfy Assumption 2.1 and 
Assumption 2.2. Let T be a conforming refinement of%. Then for all T G T and for 

any v, w G V7- 

\r ]r (v,T)-r lr (w,T)\<A 1 r lr (B,T)\\v-w\\ HHu}T) . (4.29) 

The constant Ai > depends on the dimension d and the initial mesh To- 
Proof The proof follows those in [8] and [23], and we sketch the proof below. From (3.2) 
r, 2 T (v,T) := h 2 T \\R(v)\\ 2 L2iT) + h T \\J T (v)\\l {dT) , v G V r . (4.30) 

Set e = v — w and by definition of the residual, we get 
R(v) =f-Af(w + e) 

= f + V • (AVw) - b{w) + V • (AVe) - I b'{w + ^e) d£e 

Jo 

= R(w)+T>(e), 

where T>(e) := V ■ (AVe) — f Q b'(w + ^e) d£e. Using the generalized triangle-inequality 

V '(a + b) 2 + (c + d) 2 < v^Tc 2 + 6 + d, for a, 6, c, d > 
and linearity of the jump residual we have 



Vr (v,T) = (h 2 T \\R(w) + V(e)\\ 2 L2{T) + h T \\J{w) + J( ( 



\L 2 (dT) 



1/2 



< V r(w,T) + h T \\V{e)\\ L2{T) + h l ^\\J{e)\\ L2m . 
For the second term of (4.31), by triangle inequality we obtain 



|P(e)|k(T)<||V-(AVe)|U 2(T) + 



b'(w + ^e) d^e 







(4.31) 



(4.32) 



I*(T) 



By the inverse inequality, the diffusion term satisfies the bound 

|| V • (AVe)|| L2(T) < ||divA • Ve|| L2(T) + \\A : D 2 e\ 



\L 2 (T) 

< (\\&vA\\ Loo{T) + dh^WAW^T)) ||Ve|| L2(T) , 

where D 2 e is the Hessian of e. The second term in (4.32) is bounded by 

-1 

b' (w + £e) d£e < B\\e\\ L2 ( T )- 

Li(T) 



(4.33) 



(4.34) 
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The jump term in (4.31) satisfies 

\\J(e)\\ L2{ 8T)<2(d+l)C T h T 1/2 \\A\\ LooM \\Ve\\ L2{ulT) 

— 1/2 ii 

= Cjh T ||^||L 00 (w T )||Ve|| L2 ( WT ), (4.35) 

where Ct depends only on the shape-regularity of the triangulation. Putting together (4.31), 
(4.33), (4.34) and (4.35), we obtain 

Vt{v, T) < rj T {w, T) + h T (||divA|| Loc(r) + (C, + Cj)h^\\A\\ Lao{UT) + B) \\e\\ m{uJT) 

<Vt(w,T) + C TO tVt(D,T)\\v-w\\ h i {uJt) . (4.36) 

This completes the proof. D 

The local perturbation property as demonstrated in Lemma 4.6 (respectively, Lemma 5.4 
below) leads to estimator reduction, one of the three key ingredients for contraction of 
the both the primal and combined quasi-errors. This result holds for both the primal and 
limiting dual problems, whose proof can be found in [8, Corollary 2.4] or [23, Theorem 
3.4]. 

Theorem 4.7 (Estimator reduction). Let the problem data satisfy Assumption 2.1 and 
Assumption 2.2. Let 7i be a conforming refinements of To, M. C 7i be the marked set, 
andT 2 =REFINE{T u M). Let 

Ai := (d + 2)Alm £ 2 and A := 1 - 2~ 1/d > 

with Ai from Lemma 4.6 (local Lipschitz property). Then for any v\ G Vi and v% G V2 
and 5 > 

Vl(v2, T 2 ) <(l + S) { V 2 (v h 71) - Xvl(vi, M)} + (1 + r^A^l^ - Vl f. (4.37) 

Analogously for the limiting dual problem 

C 2 2 (^,T 2 ) <(1 + 6) (Ci>i,Ti) - \&(vi,M)} + (1 + r^A^I^ - v4 2 . (4.38) 

The contraction of the primal (semilinear) problem is established in [25] and [22] 
based on Lemma 4.4, Lemma 4.5 and Theorem 4.7 as discussed above. 

Theorem 4.8 (Contraction of the primal problem). Let the problem data satisfy As- 
sumption 2.1 and Assumption 2.2. Let u the solution to (1.2). Let 9 G (0, 1], and let 
{75> Y?5 M j}j>o b e tne sequence of meshes, finite element spaces and discrete solutions 
produced by GOAFEM. Then there exist constants 7 P > and < a < 1, depending on 
the initial mesh % and marking parameter 9 such that 

\\\u - u j+1 \\ 2 + i P r] 2 +l < a 2 (\\u - Ujf + ~f p r} 2 ) . (4.39) 

5. Contraction and Convergence of GOAFEM 

In this section, we discuss the contraction and convergence of the GOAFEM de- 
scribed in §3. In particular, we show that the GOAFEM algorithm generates a sequence 
{Tj, Vj, Uj, Zj}j>o which contracts not only in the primal error as shown in §4, but also 
in a linear combination of the primal and limiting dual error. We emphasize that it would 
be difficult to derive convergence results in terms of problem (2.3) or (2.5), because at 
each refinement the problem is changing. So we show contraction in terms of the error 
in the limiting dual problem (2.6) as the target equation is fixed over the entire adap- 
tive algorithm. Our approach to contraction in this section again relies on three main 
components: quasi-orthogonality, error-estimator as upper bound on error and estimator 
reduction. Here we discuss the relevant results for the limiting dual problem with an 
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emphasis on those that differ significantly from the corresponding results for the primal 
problem. Note the limiting dual problem is not computable. We connect the error for 
the limiting dual problem to the computable quantities in the GOAFEM algorithm. For 
this purpose, we introduce Lemma 5.7, converting between limiting and approximate es- 
timators in order to apply the Dorfler property to a computable quantity; and Lemma 5.8, 
bounding the discrete error between approximate and limiting dual solutions in terms of 
the primal error. We put these results together in Theorem 5.9 to establish the contraction 
of the combined quasi-error. Finally, the contraction of this form of the error is related 
to the error in the quantity of interest in Corollary 5.10. 

5.1. Quasi-orthogonality for Limiting-dual Problem. Similar to the proof of the quasi- 
orthogonality for the primal problem, we make use of an L 2 -lifting argument for the 
limiting-dual problem. Let z G Hq(CI) and Z\ G Vi be the solutions to (2.6) and (2.11), 
respectively. We again use the duality argument, and introduce the problem: Find 
y G Hq(CI) such that 

a{y,v) + (b'(u)y,v) = (z - z l7 v) for allu G H^(Q) (5.1) 

Then we have the following L 2 -lifting result for the limiting-dual problem. 

Lemma 5.1 (Limiting-dual L 2 -lifting). Let the problem data satisfy Assumption 2.1 and 
Assumption 2.2. Let T\ be a conforming triangulation, and z G Hq(TL) and i\ G Vi be 
the solutions to (2.6) and (2.1 1), respectively. Assume that the solution y to (5.1) belongs 
to H 1+S (£l) n Hl(Vt) for some < s < 1 such that 

\y\m+s{n) < K R \\z - zxWl^u)- (5-2) 

Then 

II z — zi\\ < CJiq\z — z\\. (5.3) 

Proof. The proof follows that of Lemma 4.1. As in (4.12), we obtain for the limiting 
dual estimate (5.3) 

\\z - h\\ L2 < m- e l K R (m £ C x + BC x hv) h s \\\z - ^|||. (5.4) 

D 

Remark 5.2. To apply L 2 -lifting to the difference between two Galerkin solutions, z 2 G 
V 2 and z\ G V\ where % > Ti, we use the same proof with (5.1) replaced by the prob- 
lem: Find y G V 2 such that 

a(y,v) + (b'(u)y,v) = (z 2 — z\,v) for allv G V 2 . (5.5) 

With the help of Lemma 5.1, we obtain the quasi-orthogonality for the limiting-dual 
problem. 

Lemma 5.3 (Quasi-orthogonality for Limiting Dual Problem). Let the problem data sat- 
isfy Assumption 2.1, and T\,T 2 be two conforming triangulations with T 2 > T\. Let 
z G Hq(Q) the solution to (2.6) and 2i G Vj the solution to (2.11), % = 1, 2. There exists 
a constant C* > depending on the problem data D and initial mesh %, and a number 
< s < 1 related to the regularity of (5 . 1 ), such that for sufficiently small h we have 

p-£ 2 |I 2 <Ap-£|| 2 -p2 -vf, VwgV 2 , (5.6) 

and in particular for v — Z\ 

\\z - z 2 f < A G p - hf - p 2 - z x I 2 (5.7) 
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where 

A := (1 - Brrq l C*hl)- 1 and k G := (1 - BC^) -1 

and C% is the constant from. Lemma 5.1. 

Proof. The proof follows same arguments as in Lemma 4.4, except that in place of the 
inequality in (4.18) we have for the limiting dual problem 

a{u-u 2 ,v) + (b'(u)(z-z 2 ),v) = for all v G V 2 , (5.8) 

yielding 

2a(z - z 2 ,v - z 2 ) < 2B\\z- z 2 \\ L2 \\v - z 2 \\ L2 , (5.9) 

as in (4.19). The rest of the proof is similar to Lemma 4.4, and we omit it here. □ 

5.2. Estimator Perturbations for Dual Sequence. As we have seen in Theorem 4.7, 
the local Lipschitz property (cf. Lemma 4.6) plays a key role in deriving the estimator 
reduction property used to convert between estimators on different refinement levels in 
both the primal and limiting dual problems. The following lemma gives similar local Lip- 
schitz properties for the approximate and limiting dual problems on a given refinement 
level. 

Lemma 5.4 (Local Lipschitz Property for Dual Estimators). Let the problem data satisfy 
Assumption 2.1 and Assumption 2.2. Let T be a conforming refinement of To- Then for 
allT G T and for any v, w G V7-, it holds that 

\C T , j {^ T )-Cr, J {w,T)\<A 1 r ] r(D,T)\\v-w\\ HHu}T) . (5.10) 

In particular, for the error indicator of the limiting dual problem we have 

\(t(v,T)-(t(w,T)\<A iV t(D,T)\\v-w\\ h i (ut) . (5.11) 

The constant Ai > depends on the dimension d and the regularity of the initial mesh 

To. 

Proof. The proof follows those in [8], [23] and is nearly identical to Lemma 4.6. It is 
sketched here. To prove (5.10), by (3.3) we have 

&j{v,T) := h 2 T \\R*(v)\\ 2 L2{T) + h T \\J T (v)\\l 2{dT) , v G V r . (5.12) 

Setting e = v — w and applying linearity to the definition of the dual residual as given 
by (3.1), we obtain 

R*{v) = g + C*(w + e) = R*(w) + C*(e). 

By the same reasoning as (4.31), we get 

CtAv,T) < CtjKT) + h T \\q(e)\\ L2{T) + 4 /2 || J(e)\\ L2{dT) . (5.13) 

The term C* (respectively C* for the limiting dual) in (5.13) satisfies the same bound as 
the analogous term V in (4.31) of Lemma 4.6. Hence the bounds (5.10) and (5.11) hold 
with the same constants as in (4.29). □ 

With the help of Lemma 5.4, we are able to derive the following corollary, which 
addresses the error induced by switching between error indicators corresponding to the 
approximate and limiting dual problems on a given element. 
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Corollary 5.5. Let the problem data satisfy Assumption 2.1 and Assumption 2.2. Let T 
be a conforming refinement of%, and u, Uj are the solutions to (1 .2) and (2.9) problems, 
respectively. Let and K z the constants given in Proposition 2.4. For all T G T and 
for v, w G Vr H [z-, z+] the dual indicator on T satisfies 

\Cr,j(v,T) -( Tjk (w,T)\ < Aijyr(D,T)||u - Hl^teO +0-KzM|uj -«*llia(T)- 

(5.14) 
In particular, for T = 71, we have for the limiting estimator 

\Ci,i(v,T)-Ci(w,T)\<A 1 r ]l (B,T)\\v-w\\ HHuJT) + eK z h T \\u~u 1 \\ L2{Th (5.15) 
\Ci(w,T)-Ci,i(v,T)\<A 1 r ll (B,T)\\v-w\\ H i iu;T) + eK z h T \\u-u 1 \\ L2{T) . (5.16) 

Proof. By the definition of the residuals for the approximate dual problems, for any 
w G Vr we have 

R*(w) = g + V • (AVw) + b'(u k )w + (fe'^) - ^(u*)) w 

= R* k (w) + {b'iuj) - b'(u k )) w. (5.17) 

Using (5.17) in the definition of the dual indicator (3.3) and applying a generalized tri- 
angle inequality 

Cr,>,T) = (h 2 T \\R* k (w) + (b'( Uj ) - b\u k ))w\\l 2{T) + h T \\J T (w)\\l 2{dT) y /2 

< \h 2 T \\R* k (w)\\l 2{T) + h T \\J T (w)\\ 2 L2idT) J + hr\\b'{uj) - b\u k )w\\ L2{T) 

< ( r ,k( w > T ) + <3>K z h T \\u j -u k \\ L2iT ). (5.18) 
Applying (5.10) in Lemma 5.4 to the estimate (5.18), we obtain (5.14). □ 

As an immediate consequence of Corollary 5.5, we have the following results on the 
error induced by switching between dual estimators over a collection of elements on a 
given refinement level. This estimate plays a key role in the contraction argument below, 
as we apply it to switching between the estimator for the limiting dual and the computed 
error estimators for the approximate dual problems in the GOAFEM algorithm. 

Corollary 5.6. Let the hypotheses of Corollary 5.5 hold. Then for any subsets M.\, M. 2 C 
7i and arbitrary Si, 8%, 6a, 8b > 

CKv,M 1 )>(l + 5 1 )- 1 (l + 5 A r 1 d 1 (w,M 1 ) 

- (1 + SJ-^&KlhlWu - Ul \\l 2 -(d + 2)5^K\r,l\\v - wf m 

(5.19) 

Cli(w,M 2 ) > (l + 5 2 )-\l + 5 B )- 1 ( 2 1 (v,M 2 ) 

- (1 + 5 2 )- 1 5 B 1 Q 2 K 2 z h 2 \\u - Ul \\l 2 -(d + 2)5^Klr,l\\v - w\\ 2 m . 

(5.20) 

Proof. The conclusions follow by squaring inequality (5.15) (respectively (5.16)), ap- 
plying Young's inequality twice, and then summing over element T G M.\ (respectively 
T G M. 2 ). The H 1 norm is summed over all elements T G 7i counting each element 
d + 2 times, the maximum number of elements in each patch oj t . D 
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5.3. Contraction of GOAFEM. The main contraction argument Theorem 5.9 follows 
after two more lemmas. The first combines a sequence of estimates to convert the non- 
computable limiting estimator for the dual problem to a computable quantity, apply the 
Dorfler property and then convert back. The second relates the difference between the 
Galerkin solutions of the limiting and approximate dual problems to the primal error. 
Motivated by estimator reduction for the limiting dual problem as in equation (4.38) 

C 2 2 (^2,r 2 ) <(1 + 6) (Ci02i,Ti) - AC?(ii, M)} + (1 + S- l )Atr) 2 Q p2 - hf (5.21) 

the following lemma addresses the conversion between the limiting estimator (f (z 1( M.) 
and and the computable estimator Cii(^i) M) necessary for marking the mesh for re- 
finement. 

Lemma 5.7. Let the problem data satisfy Assumption 2.1 and Assumption 2.2. Let 
6 and K z as given by Proposition 2.4, C* as given by Lemma 4.1 and A x as given 
in Lemma 4. 7. Let 

u the solution to (1.2), u\ the solution to (2.9), 

z the solution to (2.6), Z\ the solution to (2.11) z\ the solution to (2.10). 

Let d y i(z{, M) satisfy the Dorfler property for M C 7i, namely (^±(z{ , M) > 6 ,2 Ci,i(^ij 7i)- 
Then for arbitrary Si,82,S a ,Sb > there is a 64 > such that 






II 2 



e 2 1 \ q 2 k 2 c 2 X [1+sj 2 

1 — ± * y \\\u — 7/1 

1 ;i + Sa)(1 + h)S B 6 a) (1+h) ' " 

+ ( uto + 9 A "'»( D - r »»" s ' - ^ 2 - <« 2 > 

Proof. From Corollary 5.6, L 2 -lifting 4.1 and coercivity (2.2) 

-C 2 i(zi,M) < -(l + S 1 )- 1 (l + 6 A )- 1 d 1 (zl,M) 

+ (1 + 5 1 )- 1 5 A 1 Q 2 K 2 z h 2 Q \\u - Ul \\l 2 + a^Aftd + 2) % 2 p 1 - ^H^ 
<-(l + ( 5 1 )- 1 (l + ^)- 1 C 1 2 1 (5 1 1 ,>l) 

+ (1 + s 1 )-^2 1 e 2 K' z cX il+a) \h - ^ill 2 + ^AitjgPi - %T 

(5.23) 

with Ai := Af (d + 2)m~f 2 . The Dorfler property may be applied to the first term on the 
RHSof(5.23) 

-Cl,i{zlM)<-e 2 ClM)- (5-24) 

Converting back to he limiting estimator by (5.20) in Corollary 5.6 

-Ci 2 ,i(^i) < -(1 + ^ 2 )- 1 (l + 6 B )-\ 2 (z l ,M) 

+ (1 + 5 2 )- l 5 B l <d 2 K 2 z Clhf +s) \\u - u x f + c^A^pi - z\f. (5.25) 

Define <5 4 by 

(1 + 8 4 ) := (1 + 50(1 + 5 2 )(l + 6 A )(l + 5 B ). (5.26) 
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Then by plugging (5.24) and (5.25) in the first term on the RHS of (5.23), we obtain 

-CK^M)<-e 2 (i + s i )- 1 d(z 1 ) 

+ {6 2 (1 + 5 A )- 1 (1 + 5 2 )- 1 5 B 1 + SJ 1 ) (1 + 5 1 )- 1 Q 2 K 2 Z CX {1+S) h ~ «i| 
+ (9 2 (1 + S 1 )- 1 (l + Sa)-^, 1 + 5^) A 1 r, 2 \\z 1 - z\f. (5.27) 

Finally, we split the first term on the RHS of (5.27) into two pieces for some (3 E (0, 1), 
and apply the upper-bound estimate (4.25) in Lemma 4.5 to the second piece yielding 

-(Kh, M) < -f36 2 (l + «5 4 )- 1 Ci(5i) - (1 - (3)6 2 (1 + 5,)- l C^ 2 \\z - hf 

+ {e 2 (i + s A )-\i + 5 2 y 1 5 B 1 + #) (i + 8 1 )- 1 e 2 K 2 cX {1+s) \\u - «i|| a 

+ {6 2 {l + 6 1 )-\l + 5 A )- 1 5 2 1 + 5^) A lV 2 l\ Zl - z\f. 
This completes the proof. □ 

We may convert \z\ — z\\ in the last term on the RHS of (5.22) to the error ||w — u\\ 
as stated in the following lemma. 

Lemma 5.8. Let the problem data satisfy Assumption 2.1 and Assumption 2.2. Let 
6 and Kz the constants given in Proposition 2.4 and C* and C* the constants given 
by Lemmas 4.1 and 5.1, respectively. Let 

u the solution to (1.2), U\ the solution to (2.9), 

Z\ the solution to (2.1 1), z\ the solution to (2.10). 

Then 

Pi - ^ill < eK z C*C*h 2 s \\u - wi|||. (5.28) 

Proof. Recall that 

Z\ solves a(ii, v) + (b'(u)z\, v) = g(v), for all v E Vi, (5.29) 

z\ solves a(z{, v) + {b'{u x )z\, v) = g(v), for all v E Vi. (5.30) 

Subtracting (5.30) from (5.29) and rearranging terms, we get 

a{h - zl,v) + ((b'(u) - b'{u x ))z u v) = (b'{u x ){z{ - z x ),v), v E Vi. (5.31) 

In particular, for v = Z\ — z\ E Vi equation (5.31) yields 

Pi - ^ill 2 = -((b'(u) - b'(u 1 ))z 1 ,z l - z\) - (VMizi - z{),zi - z\) 

<-<(6'(«)-6'(tti))ii,i 1 -^> (5.32) 

where in the last inequality, we used the monotonicity assumption of b in Assump- 
tion (2.1). Now applying the Lipschitz property of V ' , the a priori L^ bounds on the 
dual solution z\ (cf. Proposition 2.4), and both primal and dual L 2 lifting in (5.32), we 
obtain 

pi - ^i||| 2 < ©^z||w -mi||l 2 Pi - ^i || i, 2 

< OK z C*C*hl s \iu - wilpi - zll\ (5.33) 

from which the result follows. □ 

Now we are in position to show the contraction of GOAFEM in terms of the combined 
quasi-error which is a linear combination of the energy errors and error estimators in 
primal and limiting dual problems. 
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Theorem 5.9 (Contraction of GOAFEM). Let the problem data satisfy Assumption 2.1 
and Assumption 2.2. Let 

u the solution to (1.2), Uj the solution to (2.9), 

z the solution to (2.6), Zj the solution to (2.11). 

Let 9 G (0, 1], and let {Tj, ^j}j>o be the sequence of meshes and finite element spaces 
produced by GOAFEM. Let 7 P > as given by Theorem 4.8. Then for sufficient small 
mesh size h , there exist constants 7 > 0, n > and an E (0, 1) such that 

P - h I 2 + l(l{h) + 7r||w - W2I 2 + irj p r)l(u 2 ) 

< a 2 D (\\z - z x f + 7Ci(^i) + tt||u - Mil 2 + 7T7 P 77?(ui)) . (5.34) 

Proof. For simplicity, we denote r] = r/ (D, %) and ( k (z k ) = ( k (z k , %), k = 1,2. By 
the estimator reduction for the limiting dual problem (4.38), for arbitrary 5 > we have 

Cl{h) <(1 + S) {C?(ii) - ACi(*i, -M)} + (1 + r^Ax^ps - 5J 2 , (5.35) 

where A = 1 — 2~ 1 / d . Recall the quasi-orthogonality estimate in the limiting dual problem 
from Lemma 5.3 

III A A III 2 ^- A III A "■ III 2 III A ^ III 2 /c 1 /^\ 

If --22 1 < A G \\z - zi\\ - \lz 2 - zi\\ . (5.36) 

Adding (5.36) to a positive multiple 7 (to be determined) of (5.35) and applying the 
results of Lemmas 5.7 and 5.8 obtain 

P - -Ml 2 + 7C2 (^2) < -4p - ill 2 + 7^Ci(zi) + £>|u - wil 2 

+ (7(1 + S-^Atvl - 1) p a - zip- (5-37) 
We first set 7 := (1 + 8~ 1 )~ 1 A^ 1 r] ~ 2 to eliminate the last term in (5.37). This yields 

p - -MI 2 + lCl(z 2 ) < A\\z - Zl f + 7MCi(ii) + D\\u - wi| 2 , (5-38) 
where the coefficients A and M of (5.38) are given by 

A = A G - (1 - /3)A0 2 <J(1 + «5 4 )- 1 C7r 2 Ar 1 % 2 (5.39) 

M = (1 + <J)(1-/3A0 2 (1 + <$ 4 ) _1 ) (5.40) 

where 5 4 satisfies (1 + 64) := (1 + <5i)(l + <5 2 )(1 + $a) (1 + £b) as was given in (5.26). 
For contraction, we require A < 1 and M < 1 for the coefficients defined by (5.39) 
and (5.40), that is, we need to choose a /3 E (0, 1) such that 

6 1 + 64 <P<l- {kG ~ 1)Acl -4^, (5.41) 



1 + 5 \9 2 ' 5 \9 2 

with Ac := C 2 Ai7y 2 . To demonstrate the existence of such a )3, set 



6 4 = 5 = ^\9 2 . (5.42) 

Then we require the mesh size ho sufficiently small, such that 

^ <l+ 2(2 + xl)Ac ' <5 ' 43) 

for the given 9 G (0, 1). Note the conditions (5.42) and (5.43) guarantee that the interval 
in (5.41) is nonempty, so there exists a (3 such that 

I^l- 'k-QAc / _2_ 

2 ' \ei \ \e 2 
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It remains to control the last term in (5.38). For simplicity, we assume 8\ = 5 2 = 5 a = 
5 b —■ 5c- Then the coefficient D in (5.38) is given by 

d - ««•**** C^w) Grow + ew ) • (5 - 44) 

To control the primal error term with the coefficient D as given by (5.44), we add a posi- 
tive multiple 77 (to be determined) of the primal contraction result (4.39) of Theorem 4.8 
to (5.39) yieding 

p - z 2 \ 2 + 7(2(^2) + 7t||m - u 2 \\ 2 + 7r-f p r]l(u 2 ) 

< A\\z - zil 2 + iMQl{h) + (D + a 2 Ti)\\u - uxf + aV^^i). (5.45) 

We choose n to ensure D + a 2 ir < n, namely, 

vr > — — 2 (5.46) 

1 — ot z 

and set 

a 2 D := max < A, M, ,a 2 <1. (5.47) 

Then the combined quasi-error satisfies the contraction property (5.34). □ 

For simplicity, we denote by 

Q 2 (uj,Zj) = p — Zj\\ 2 + 7C|(%) + vt||m - Uj\\ 2 + n'y p r)'j(uj) 

the combined quasi-error in (5.34). The following corollary gives the contraction of 
the error in the goal function, which is determined by the contraction of the combined 
quasi-error. 

Corollary 5.10. Let the assumptions in Theorem 5.9 hold. Then the error in the goal 
function is controlled by a constant multiple of the square of the combined quasi-error, 
i.e., 

\g(u)-g( Uj )\ < CQ 2 (u„ Zl ) < a 2 SCQ 2 (uo,zo). (5.48) 

Proof. Choosing the test function v = u — Uj in (2.6), and by linearity and Galerkin 
orthogonality for the primal problem, we obtain 

g(u) — g(uj) = a(z, u) + (b'(u)z, u) — a(z, Uj) — (b'(u)z, Uj) 

= a(u — Uj, z) + (b'(u)(u — Uj), z) 

= a(u — Uj, z) + (Bj(u — Uj), z) + ({b'(u) — Bj)(u — Uj), z) 

= a{u — Uj, z — Zj) + (b(u) — b{uj), z — zf) + ((&'(«) — Bj)(u — Uj), z). 

(5.49) 

The third term in the last line of (5.49) represents the error induced by switching from 
(2.6) to (2.3). This term may be bounded in terms of the constants and L^ estimates in 
Proposition 2.4 and 

-1 
b'(u) - b' (uj + £(u - Uj)) d£ 



\\b'(u)-B 3 \ 
yielding 






({b'(u) -Bj){u -Uj),z) < K z \\b'{u) - Bj\\ L2 \\u - Uj\\ L 

1 
2 



< ^6K z \\u - Uj \\l 2 . (5.50) 
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Then by (5.49), (5.50), the Cauchy-Schwarz inequality and L 2 -lifting as in Lemmas 4.1 
and 5.1 



1 



\g{u) - g(uj)\ < \\u — Uj\l\\z- %||| + B\\u - Uj\\ L2 \\z - %|| L2 + -QK z \\u 



u 



j\\L 2 



< (1 + BC*cx s )h - «illP - %ll + ^®K z Cihl s \\u - u 



< -[l + {QK z C*+BC*)C*h$ 



\\u — u 



•J\ 



[1 + BC*C*h, 



2s\ 
* w *"'0 J 



Z - Zi 



(5.51) 



Therefore the error in the goal function is bounded above by a constant multiple of the 
square of the combined quasi-error Q 2 (iij, Zj). Thus (5.48) follows by the contraction 
result in Theorem 5.9. □ 

6. Numerical Experiments 

In this section, we present some numerical experiments implemented using FETK [20], 
which is a fairly standard set of finite element modeling libraries for approximating the 
solutions to systems of nonlinear elliptic and parabolic equations. In these experiments, 
we consider Vt = [0, l] 2 and try to solve the following model problem: 

J\f(u) := -Au + 3u 3 = f, (6.1) 

with homogeneous Dirichlet boundary condition. Here the source function / is chosen 
such that the exact solution is given by 

= sin(7rx)sin(7ry) 

1 ' y > 2{x - 0.5) 2 + 2{y - 0.5) 2 + lO" 3 ' 

We solve the nonlinear problem using both the AFEM and the GOAFEM algorithms. 
Figure 1 shows a typical adaptive mesh as well as the solution of this semilinear equation. 
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Finite element solution 




Figure 1 . The mesh and finite element solution to the model problem 
after 12 GOAFEM iterations. 



In the adaptive algorithms, we use the Dorfler marking strategies: (3.7) for the primal 
problem and (3.8) for the dual problem, with the same parameter 9 = 0.4. For the primal 
nonlinear problem (6.1), at each refinement we use a Newton-type iteration to solve the 
resulting nonlinear system of algebraic equations, which reduces the nonlinear residual 
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Error Reduction of lg(u-Ujl 




Figure 2. The reduction rate in the goal error \g(u — Uu) \ . 

to the tolerance ||F(w)||x, 2 < 1CT 7 . On the initial triangulation, we use a zero initial 
guess for the Newton iteration; then for each subsequent refinement, we interpolate the 
numerical solution from the previous step to the current triangulation and then use it as 
the initial guess for the Newton iteration. By doing this, we have a good initial guess 
for the Newton iteration so that one could expect a quadratic convergence rate of the 
nonlinear iterations. In fact, according to our numerical experiments, it usually takes 4 
or 5 Newton iterations to reach the setting tolerance. 

To test the performance of the GOAFEM algorithm, we take the goal function 

-100((a:-0.5) 2 +(2/-0.5) 2 ) 



g = lOOe" 



(6.2) 



so that it captures the singularity of the solution to (6.1). Thus the error in goal functional 
\g(u — Uh)\ gives us an error by weighted average. Figure 2 shows the reduction rate of 
the goal error \g(u — Uu)\ for the GOAFEM algorithm. The oscillation at the first a few 
iterations in Figure 2 reflects the fact that the mesh size is not small enough, which is 
one of the requirements in our theory. But after a few iterations, the goal error reduces at 
certain rate, which was predicted by Theorem 5.48. This result confirms our theory. 

For comparison with the standard AFEM algorithm, we also show in Figure 3 the error 
reduction in H 1 semi-norm \u — Uh\ H i for both AFEM and GOAFEM algorithms with 
the same goal function given in (6.2). As one can see from Figure 3, reducing H 1 error to 
the same magnitude with both algorithms, the GOAFEM algorithm takes about 5 fewer 
refinement steps than the standard AFEM algorithm. 



7. Conclusion 

In this article we developed convergence theory for a class of goal-oriented adaptive 
finite element algorithms for second order semilinear elliptic equations. We first intro- 
duced several approximate dual problems, and briefly discussed the target problem class. 
We then reviewed some standard facts concerning conforming finite element discretiza- 
tion and error-estimate-driven adaptive finite element methods (AFEM). We included a 
brief summary of a priori estimates for semilinear problems, and then described goal- 
oriented variations of the standard approach to AFEM (GOAFEM). Following the re- 
cent work of Mommer-Stevenson and Hoist-Pollock for linear problems, we established 
contraction of GOAFEM for the primal problem. We also developed some additional 



22 M. HOLST, S. POLLOCK, AND Y. ZHU 



Error Reductions in H Norm 



— * — H' Error (or GOAFEM 
> H 1 Error (or AFEM 
■ - N" 1ffl 




Figure 3 . The reduction rate in the H 1 norm of the error u — Uh- 

estimates that make it possible to establish contraction of the combined quasi-error, and 
showed convergence in the sense of the quantity of interest. Some simple numerical 
experiments confirmed these theoretical predictions. Our analysis was based on the re- 
cent contraction frameworks for the semilinear problem developed by Hoist, Tsogtgerel, 
and Zhu and Bank, Hoist, Szypowski and Zhu and those for linear problems as in Cas- 
con, Kreuzer, Nochetto and Siebert, and Nochetto, Siebert, and Veeser. In addressing 
the goal-oriented problem we based our approach on that of Mommer and Stevenson for 
symmetric linear problems and Hoist and Pollock for nonsymmetric problems. How- 
ever, unlike the linear case, we were faced with tracking linearized and approximate dual 
sequences in order to establish contraction with respect to the quantity of interest. 

In the present paper we assume the primal and approximate dual solutions are solved 
on the same mesh at each iteration. The determination of strong convergence results for 
a method which solves the primal (nonlinear) problem on a coarse mesh and the dual on 
a fine mesh is the subject of future investigation. 
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